Bootstrapping populations について

Words near each other

Dictionary Lists

mini英和辞書

mini和英辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Bootstrapping populations ：ウィキペディア英語版

Bootstrapping populations

Starting with a sample

\

observed from a random variable ''X'' having a given distribution law with a set of non fixed parameters which we denote with a vector

\boldsymbol\theta

, a parametric inference problem consists of computing suitable values – call them estimates – of these parameters precisely on the basis of the sample. An estimate is suitable if replacing it with the unknown parameter does not cause major damage in next computations. In Algorithmic inference, suitability of an estimate reads in terms of compatibility with the observed sample.
In this framework, resampling methods are aimed at generating a set of candidate values to replace the unknown parameters that we read as compatible replicas of them. They represent a population of specifications of a random vector

\boldsymbol\Theta

〔By default, capital letters (such as ''U'', ''X'') will denote random variables and small letters (''u'', ''x'') their corresponding realizations.〕 compatible with an observed sample, where the compatibility of its values has the properties of a probability distribution. By plugging parameters into the expression of the questioned distribution law, we bootstrap entire populations of random variables compatible with the observed sample.
The rationale of the algorithms computing the replicas, which we denote ''population bootstrap'' procedures, is to identify a set of statistics

\

exhibiting specific properties, denoting a well behavior, w.r.t. the unknown parameters. The statistics are expressed as functions of the observed values

\

, by definition. The

x_i

may be expressed as a function of the unknown parameters and a random seed specification

z_i

through the sampling mechanism

(g_,Z)

, in turn. Then, by plugging the second expression in the former, we obtain

s_j

expressions as functions of seeds and parameters – the master equations – that we invert to find values of the latter as a function of: i) the statistics, whose values in turn are fixed at the observed ones; and ii) the seeds, which are random according to their own distribution. Hence from a set of seed samples we obtain a set of parameter replicas.
== Method ==

Given a

\boldsymbol x=\

of a random variable ''X'' and a sampling mechanism

(g_,Z)

for ''X'', the realization x is given by

\boldsymbol x=\(z_m)\}

, with

\boldsymbol\theta=(\theta_1,\ldots,\theta_k)

. Focusing on well-behaved statistics,
:
for their parameters, the master equations read
: (z_m))= \rho_1(\boldsymbol\theta;z_1,\ldots,z_m)
|-
| width=90% |

\vdots\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \vdots \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \vdots

| width=10% align="center" | (1)
|-
|

s_k= h_k(g_ (z_1),\ldots, g_ (z_m))= \rho_k(\boldsymbol\theta;z_1,\ldots,z_m).

|}
For each sample seed

\

a vector of parameters

\boldsymbol\theta

is obtained from the solution of the above system with

s_i

fixed to the observed values.
Having computed a huge set of compatible vectors, say ''N'', the empirical marginal distribution of

\Theta_j

is obtaineded by:

:^N\fracI_(\breve\theta_)
| width=10% align="center" | (2)
|}
where

\breve\theta_

is the j-th component of the generic solution of (1) and where

I_(\breve\theta_)

is the indicator function of

\breve\theta_

in the interval

(-\infty,\theta].

Some indeterminacies remain if ''X'' is discrete and this we will be considered shortly.
The whole procedure may be summed up in the form of the following Algorithm, where the index

\boldsymbol\Theta

\boldsymbol s_

denotes the parameter vector from which the statistics vector is derived.

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Bootstrapping populations」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース